An Efficient Block-based Dynamic Range Adjustment Method in Noise-robust Continuous Speech Recognition
نویسندگان
چکیده
This paper proposes a new technique for speech feature estimation under noise circumstances. This new approach yields noise-robust continuous speech recognition (CSR). Noiserobust techniques for isolated word speech recognition typically employ the running spectrum analysis (RSA), the running spectrum filtering (RSF) and the dynamic range adjustment (DRA) methods. Among them, only RSA has been applied into a CSR system. However, we propose an enhanced DRA for a noise-robust CSR system. Thus, in the speech recognition stage, the continuous speech waveform is automatically divided into short blocks and DRA is applied to these blocks. We find that the proposed method improves recognition performance under several different noise and SNR conditions.
منابع مشابه
A Noise-Robust Continuous Speech Recognition System Using Block-Based Dynamic Range Adjustment
SUMMARY A new approach to speech feature estimation under noise circumstances is proposed in this paper. It is used in noise-robust continuous speech recognition (CSR). As the noise robust techniques in isolated word speech recognition, the running spectrum analysis (RSA), the running spectrum filtering (RSF) and the dynamic range adjustment (DRA) methods have been developed. Among them, only R...
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملRobust Speech Recognition with MSC/DRA Feature Extraction on Modulation Spectrum Domain
This report introduces noise robust speech recognition and proposes advanced speech analysis techniques named MSC (Modulation Spectrum Control)/DRA (Dynamic Range Adjustment). The dynamic range of cepstrum obtained from noisy speech is usually smaller than that from the same speech without noise since some speech features are hidden in noise. This difference may cause recognition errors. Theref...
متن کاملروشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه
Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...
متن کاملCompression of Model-based Group Delay Function for Robust Speech Recognition
In this paper, we improve the performance of the ARGDMF [3] feature by adding a nonlinear filtering block. ARGDMF is a group delay-based feature consists of four main parts, namely autoregressive (AR) model extraction, group delay function (GDF) calculation, compression, and scale information augmentation. The main problem with the GDF is its spiky nature which is solved by coupling the GDF wit...
متن کامل